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Abstract. We prove that O(e’"'®") states are sufficient to simulate an n-state infa recognizing 
a unary language by a idfa. The lower bound is the same. Similar tight bounds are shown for 
the simulation of a 2dfa by a 1dfa and a Infa. We also show that O(n”) states are sufficient and 
necessary to simulate an n-state Infa recognizing a unary language by a 2dfa. 


1. Notation 


By an fa we denote a finite automaton. Using appropriate prefixes we specify 
what kind of an fa we consider. The possible prefixes are formed of the symbols: 
e 1: one-way, 

e 2: two-way, 

e d: deterministic, 

e n: nondeterministic, 
e a: alternating. 

For example, a 2dfa is a two-way deterministic finite automaton. A unary language 
is a language over a one-symbol alphabet. A unary fa is an fa with a one-symbol 
input alphabet. Clearly, a unary fa recognizes a unary language. Also, if an fa 
recognizes a unary language, then we can make it unary by deleting unnecessary 
symbols and modifying the next-state function. 

We will consider only unary fa’s. In this case, input words can be identified with 
nonnegative integers and we will write x instead of a”. Moreover, a unary Infa is 
simply a digraph whose vertices are states and whose edges correspond to the 
next-state function. Therefore, we can talk of vertices, edges, cycles, strongly con- 
nected components, etc. of a Infa. We will describe a infa as a quadruple 
(Q, qo, E, F), where Q, qo, F are as usual and E is the set of edges corresponding 
to the next-state function. As usual, L(A) is the language accepted by an fa A. A 
sweeping 2nfa is a 2nfa making reversals only at the endmarkers. 

All logarithms are to base e. Sometimes we will write exp(x) instead of e”. 

By gcd(x,,...,X,) we denote the greatest common divisor of x,,..., Xx, and by 
Iom(x,,..., Xx) their smallest common multiple. The following function will play 
a basic role in our investigations: 


F(n) =max{lom(x,,..., x)| x1 +--+ +x, = n}. 
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For a,,.--,4, such that gcd(a,,...,a,)=1 we denote by G(a,,...,a,) the 
greatest number b such that the Diophantine equation a,x, +- +--+ a,x, =b has no 
solution in natural numbers. 

By H(n) we denote the function H(n) = eVnican 


2. Introduction 


In this paper we investigate the following classical problem of the automata 
theory: given two classes of fa’s C,, C,, how many states are necessary and sufficient 
to simulate n-state automata from C, by automata in C,. It is well known that all 
‘reasonable’ fa’s, even 2afa’s, recognize only regular sets [2, 5, 12,21]. However, 
stronger fa’s can describe a given language using less states. A fundamental theorem 
about one-way fa’s is that 2” states suffice to simulate any n-state 1nfa by a 1dfa 
and there are examples showing that this bound cannot be improved [7, 16, 18, 21, 
29]. Other known bounds are: 

- each n-state 2dfa can be simulated by a 1dfa with O(n") states [1, 2, 21, 26]; 
- each n-state 2nfa can be simulated by a 1dfa with O(2”) states [2, 21]; 
- each n-state lafa can be simulated by a 1dfa with O(2”’) states [5]. 

It is also known that these bounds are asymptotically best possible [1, 2, 5, 18, 21]. 

The problem of simulating 1nfa’s (or 2nfa’s) by 2dfa’s was stated by Sakoda and 
Sipser in [22] and still remains open. Berman and Lingas [2] combine bounds for 
the simulation of 2dfa’s and 2nfa’s by 1dfa’s to show that, in general, 0(n7/log n) 
states are necessary to simulate a 2nfa by a 2dfa. Also, as it was shown by Sipser 
[27], sweeping 2dfa’s (that is, 2dfa’s making reversals only at the endmarkers) 
require 2” states to simulate infa’s. Sakoda and Sipser conjecture that this also 
holds for all 2dfa’s. The problem has not only interest of its own. The following 
theorem, due to Berman and Lingas [2], relates it to the DLoG = NLoG problem: 
If DLoG = NLoa, then there is a polynomial p such that, for each integer m and 
each n-state 2nfa A, there is a p(mn)-state 2dfa B such that Lmn(A)= Lmn(B), 
where L,(A) is the set of words in L(A) of length at most k. 

In this paper we consider the particular case of the above problems when the 
languages considered are unary. The problem was mentioned by Sipser [27]. Note 
that the proofs of lower bounds in the general case essentially use the fact that the 
alphabet consists of at least two letters. The proofs, based on the Myhill-Nerode 
theorem, are of information-theoretic nature: usually, it is shown that a 1dfa with 
too few states is not able to carry enough information through the input word. In 
case of unary languages, as we will show later, we face number-theoretic problems 
which, fortunately, are now quite satisfactorily solved. Using these number-theoretic 
methods we prove that 

(1) each unary n-state 1nfa can be simulated by a idfa with O( H(n)) states and 
this bound is asymptotically optimal; 

(2) each unary n-state 2dfa can be simulated by a 1dfa with O( H(n)) states and 
this bound is asymptotically optimal; 
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(3) each unary n-state Infa can be simulated by a 2dfa with O(n’) states and 
this bound is asymptotically optimal. 

A weaker result similar to (1) was obtained by Liubicz in [15]. He proved that, 
in our notation, the upper bound in (1) is O(nH(n)). The proof in [15] is matrix- 
oriented, using some results about positive matrices. Our method is purely com- 
binatorial. We transform each 1nfa to the normal form (Lemma 4.3), from which 
we derive (1) and (3). The normal-form lemma has interest of its own since it says 
that each unary 1nfa A can be substituted by an equivalent infa A’ making only 
one guess, and the size of A’ is bounded by a small-degree polynomial of the size 
of A. 

In (2), the lower bound applies also to 1nfa’s instead of 1dfa’s. The 0(n”) lower 
bound in (3) is the best lower bound for the simulation of 2nfa’s by 2dfa’s we know 
of. It strengthens the mentioned result of Berman and Lingas [2] for 2nfa’s, not 
only because the function is of higher order but also because it concerns very simple 
2nfa’s, namely unary Infa’s. It is amazing that the proof is so simple. Unfortunately, 
our technique cannot give better lower bounds. 

Unary languages have already been studied in the theory of automata and 
complexity. For example, they can be used as witness languages for proving separ- 
ation results about space complexity classes [8, 11, 24, 25], or for solving the ‘k+1 
versus k heads’-problem for multihead automata [20]. Also, some known open 
problems, as, for example, the LBA problem, can be reduced to problems about 
unary languages [19]. In [8, 11], nonregular unary languages of space complexity 
O(log log n) were constructed. Unary languages have also been studied in the theory 
of AFLs [9]. This research was partially motivated by the above results. 

Except results using diagonalization, it seems almost a rule that when dealing 
with unary languages, one arrives at number-theoretic problems [8, 9, 10, 11]. For 
example, the proofs that the languages constructed in [8, 11] are of space complexity 
O(log log n) use results from the analytic number theory. 


3. Two number-theoretic problems 


The problem of finding a good approximation for F(n) is known as Landau’s 
problem [13, 14, 28, 30]. The problem is usually stated in terms of permutation 
groups: what is the maximal order in S„, the symmetric group on n symbols? (the 
order of P in S, is the order of the cyclic subgroup generated by P). Landau [13, 14] 
has proved that 


lim, log F(n)/Vn log n=1. 


The best known approximation is due to Szalay: 


Theorem A ([28]) 
F(n)=exp[(n log n+log log n—1+ (log log n -2+0(1))/log n)'”7]. 
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For our purpose the bound in the corollary below will be sufficient. 


Corollary A. F(n)=O(H(n)). 


The second problem concerns linear Diophantine equations. First consider 
equations with two variables. The following well-known fact will be used in our 
proofs. 


Fact A. If gcd(a, b)=1, then the greatest number such that the equation ax + by = c 
has no solution in natural numbers is (a—1)(b—1)—1. 


Frobenius stated the problem of generalizing this result, that is, of finding a good 
approximation for G(a,,..., ap). There are quite a lot of papers on Frobenius’s 
problem, although the known approximations still seem far from the exact value. 
We will use the following result. 


Theorem B ([3,15]). Let a,<---<a, and gcd(a,,...,a,)=1. Then we have 
G(a,,..., &) < (a, —1)(a,—1). 


Erdös and Graham [6] give a more accurate approximation, as well as some more 
references to the problem. The corollary below follows from Theorem B. 


Corollary B. Let a,,..., a, be natural numbers <n. Let X be the set of all x’s for 
which the Diophantine equation a,x; +: > ++ a,x; =x is solvable in natural numbers. 
Then the set of numbers in X greater than n? is an arithmetic progression with period 
ged(a,,..., a). 


4. infa versus idfa 


In this section, we will present tight bounds for the simulation of unary 1nfa’s 
by 1dfa’s. 


Definition 4.1. Let A be a unary infa such that r of its vertices are in cycles and s 
of them are not (so r+ s is the number of A’s states). Then we define S(A) = (r, s). 


Definition 4.2. A unary Infa A = (Q, qo, E, F) is in normal form if it has the following 
properties: 
(a) Q={qo,--+s mt} UCU UG, 
where C; = {Pios Pins- - -> Piy,-1}> for i=1,...,k, 
(b) E={(q,qm)li=1,...,m—1} 
U {( Pij Pyi)li=1,..., k and j=0,...,¥;-1} 
U{(Gms Pio)li=1,..., } 
(the addition j+1 in the second component is mod y,). 
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Informally, A consists of a path from qo to qm and cycles C,,..., C, connected 
tO qm. An example of an automaton in normal form is shown in Fig. 1, for m=4, 
k=3, w=4, y2=5, ¥3=3. 


Lemma 4.3. For each unary n-state 1nfa A there is an equivalent 1nfa A’ in normal 
form such that S(A') = (n, O(n’)). 


Proof. Let A= (Q, qo, E, F). Without any loss of generality, we can assume that 


F = {qr}. 
A superpath in A is a subgraph of A denoted by 


a=P,D,P,D,... P,D,P,+1, 


where 

(a) for i=1,..., 4, D; is a strongly connected component of A; 

(b) for i=1,...,¢+1, P; is a path in A whose inner points do not belong to 
strongly connected components of A; 

(c) the first vertex of P, is qo, the last vertex of P,+ı is gF; 

(d) for i=1,..., ¢, the last vertex of P, belongs to D,; 

(e) for i=2,...,t+1 the first vertex of P; belongs to D,_,. 

Let L, be the set of all lengths of paths from qo to qr in a. Let IT be the set of 
all superpaths in A. Then L(A) is the union of all sets L, for a in JI. 

For a strongly connected component D of A, let IT(D) be the set of all superpaths 
a such that D is the last strongly connected component in a. Let Ip be the set of 
all superpaths which do not contain strongly connected components (that is, simple 
paths from qo to qr). Then 

L(A)= U LYU U La 


aelly D aellI(D) 


where the second sum is taken over all strongly connected components D in A. 


P3,0 P3,2 


P3,1 


Fig. 1. A unary Infa in normal form. 
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Let us now fix a strongly connected component D and a superpath «æ in IT(D), 
a=P,D,P,D,... P,D,P,+,, D: = D. Let gcd(a) be the greatest common divisor of 
the lengths of the cycles in a, and gcd(D) the greatest common divisor of the lengths 
of the cycles in D. Clearly, gcd(a) divides gcd(D). 

Let xe La and let R be the path in a of length x. Then x = Xo+ a,x, +--+ ° + apXp, 
where xo is the length of the path obtained from R by deleting the cycles, and 
āo, - . - , Ap are the lengths of all cycles in a. Let m= n?+n. By Corollary B we obtain 
that L, = LL u LZ, where L} is the subset of La containing numbers <m, and L3 
is an arithmetic progression with period gcd(a). It is easy to see that we could 
substitute for a an ‘equivalent’ subgraph consisting of a single path and a cycle of 
length gcd(a) attached to it. Instead we will use a cycle of length gcd(D), so, in 
this way, it can be used as a common cycle for all superpaths in IJ(D). 

Let T,,..., Tą be all strongly connected components of A in some fixed order. 
Then A’ will have the set of states as in Definition 4.2, where m and k are the 
numbers defined above and y; = gcd(7;), i=1,..., k. We only have to mark appropri- 
ate states as accepting. Clearly, qx € F' iff xe L} for some superpath a (for a € Io 
we set L} = La). Similarly, p,,¢ F' iff xt+me L? for some superpath ae I7(T;), 
ishak O 


Theorem 4.4. For each unary n-state 1nfa A there is an equivalent idfa B with 
O(H(n)) states. 


Proof. For a given infa A with n states, we first construct a Infa A’ in normal form 
such that S(A’) = (r, s), for r< n and s =O(n’), as in Lemma 4.3. Let y,,..., Yx be 
the lengths of cycles C,,...,C, and y= Icm(y;,...,¥). Then B=(Q, qo, E, F), 
where 


Q={4, e.. Ys—15 ss» »-, qy+s-1}, 
E = {(qi, Qi+1) |i =0, 1, passa ts—2u{(qyss-i, qs)} 


and F is defined.as follows. If q, 0< i< s, is an accepting state in A’, then q; € F. 
Also, if some p;; is accepting in A’, then q,,,¢ F for each t such that t—j = cy; for 
some integer c. It is straightforward to check that L(B)= L(A). B has y+s—2= 
O( F(n))+O(n?) =O(F(n)) , states. Using Corollary A we then obtain the 
theorem. O 


Theorem 4.5. For each n there is a unary n-state 1nfa A such that each \dfa recognizing 
L(A) requires Q(H(n)) states. 


Proof. We will show that F(n—1) states are necessary. Let n be arbitrary but fixed 
and x,,..., Xx be the numbers for which the maximum in the definition of F(n —1) 
is attained. W.].0.g. we can assume that x; <: + -< xXx. From the properties of F(n) 
it is also known that they are relatively prime in pairs. Let also 


L={cex,|i=1,...,k ce N}. 
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Consider a Infa A = (Q, qo, E, F) such that 
Q={ghU{py|1<i<k0<j <x}, 
F={piol|1<i<k}, 
E={(qo, Pi) |i=1,..., KU {CPi Py li=1,...,47=0,...,%-D 
(the addition is mod x;). 


So A consists of the initial state and several cycles. Obviously, L(A) = L and A has 
n states. 

Now, let R be the Myhill-Nerode equivalence relation: R(u, v) iff (u+ze L iff 
v+ze L for each z). 

We will show that the index of R is at least y= F(n—1). In order to do this we 
will prove that, for each two different 0<u, v < y, it is not true that R(u, v) holds. 
We must find a z such that exactly one of u+z, v+z belongs to L. 

If 2<i<k, or i=1 and x, #2, then we define 


0 u;#O0and v; +0, 
Z=41 u,=0,0S0,<x,-lory,=0,0<u;<x,—-1, 


2 u,=0,0,=x,—lorv,=0, u=x;,—1, 
where u; = u mod x;, v; = v mod x;. If x, =2, then 


3 = u, ~ v, Oru, =v, =1, 
1 1 u, =v, =0. 


By the Chinese Remainder Theorem there is exactly one z such that, for i= 
1,2,...,k,z mod x; = z;. Then, by the definition of z;, we have that (u + z)mod x; #0 
and (v+z)mod x; # 0 for i=2, 3,... Ik. 

Suppose first that x, =2 and u, # v,. Then, either u, =0 and v, = 1, or u,=1 and 
v,=0. In both cases we have that exactly one of u+z, v+z belongs to L. 

Otherwise, let j be the smallest number such that u; # v;. W.l.0.g. we can assume 
that (u + z)mod x; >(v+z)mod x;. Let t; = 0 for i # j and t = x; — (u + z)mod x. Then 
again by the Chinese Remainder Theorem, there is exactly one t such that t mod x; = 
t for i=1,..., k. Then u+z+te L, but v+z+t#L. O 


5. 2dfa versus ldfa 

Theorem 5.1. Each unary n-state 2dfa can be simulated by a 1dfa with O( H(n)) states. 
Proof. It is easy to show that any unary 2dfa can be substituted by an equivalent 
sweeping 2dfa without increasing the number of its states. So let A=(Q, qo, 5, F) 


be a sweeping unary 2dfa with n states. A 1dfa B simulating A on input x first 
checks if x is a word of length <n accepted by A. If not, A must make a cycle on 
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each pass on x. Let y,,..., Yx be the lengths of all of A’s cycles. Clearly, y+: > -+ 
Yk <n, because no state can be in two different cycles. For two numbers v>u>n 
such that v-u=Icm(y,,..., Yk), A accepts u iff A accepts v. Therefore, the cycle 
of length y suffices to simulate A on words longer than n. Now the theorem follows 
from Corollary A. O 


Theorem 5.2. For each n there is a unary n-state 2dfa A such that each 1 dfa recognizing 
L(A) requires O(H(n)) states. 


Proof. The theorem follows from the fact that the language L from Theorem 4.5 
can be recognized by a 2dfa A with n states. A simply makes k passes over an input 
x computing x mod x; in the ith pass, i=1,2,...,k O 


6. 2dfa versus infa 


First we strengthen Theorem 5.2 by showing that the lower bound even holds for 
Infa’s. 


Theorem 6.1. For each n there is a unary n-state 2dfa A such that each 1nfa recognizing 
L(A) requires Q(H(n)) states. 


Proof. Let x,,..., x; be the numbers for which the maximum in the definition of 
F(n) is attained. Let also L={cF(n)|ceN'}. There is a 2dfa A with n states 
recognizing L. A behaves similarly to the automaton from the proof of Theorem 
5.2, except that it accepts x iff x mod x;=0 for each i=1,2,...,k. The shortest 
word in L is F(n). Consider a 1nfa B recognizing L. B must have a simple path 
of length at least F(n) between the starting state and a final state, because otherwise 
it would accept a word shorter than F(n). This proves the theorem. O 


Theorem 6.2. Each unary n-state 1nfa A can be simulated by a 2dfa B with O(n’) states. 


Proof. For a given infa A with n states we construct a Infa A’ in normal form such 
that S(A’)=(r, s), for r&n and s=O(n’), as in Lemma 4.3. A 2dfa B simulating 
A’ first checks if an input x is <s. If so, B accepts iff q, is an accepting state of A’ 
(the notation is from Lemma 4.3). Otherwise, B makes k passes over the input and 
the length of the cycle in the jth pass is y;. So, in the jth pass, B computes 
t = (x—s)mod y; and accepts x iff p,, is an accepting state of A. This completes the 
proof, because B has r+s=O(n’) states. O 


Theorem 6.3. For each n there is a unary n-state infa A such that each 2dfa recognizing 
L(A) requires Q(n?) states. 
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Proof. Let L= {x|x = nx,+(n—1)x, for xı, x.¢N}. L can be recognized by a Infa 
A with n states. A=(Q, qo, E, F) is defined as follows: Q={go,..., đa}, E= 
{(Gis qiz) |i=0,...,n—1}o {(q1, q3)} (the addition is mod n) and F={qo}. Let 
m = max(N-— L). By Fact A, m=O(n’). Consider a 2dfa B recognizing L and its 
computation on m. Suppose that, in all passes on m, B enters a cycle and let 
Yiz--->¥e be the lengths of these cycles. Then B would reject also m’= 
mt+lcm(y,,..-, Yk), Which contradicts the fact that m’ € L. Therefore, there is a pass 
of B on m without a cycle and the theorem follows. O 


7. Final remarks 


Informally speaking we have shown that 1nfa’s and 2dfa’s are hard to simulate 
by idfa’s, even if we consider only unary languages. Also, for unary languages, 
two-way motion is more powerful, in a sense, than guessing, because we can simulate 
unary 1nfa’s by 2dfa’s increasing the number of states only polynomially, which is 
not possible the other way round. Comparing it with the upper bound for 1dfa’s 
one can say that nondeterminism does not help if we want to simulate 2dfa’s by 
one-way fa’s. 

The following problems are still open: 

(1) (The Sakoda-Sipser problem for unary languages.) Does there exist a poly- 
nomial p such that each unary n-state 2nfa can be simulated by a p(n)-state 2dfa? 
Actually, the problem of Sakoda and Sipser concerns also 1nfa’s, but as we have 
shown in Section 6, in this case the answer is positive. 

(2) What is the relationship between unary 1lafa’s (or 2afa’s) and other fa’s? It 
is easy to show some lower and upper bounds for 1afa’s with only universal states. 
The author believes that unary lafa’s and 2dfa’s are polynomially equivalent. 
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